The Advantage of Cross Entropy over Entropy in Iterative Information Gathering
نویسندگان
چکیده
Gathering the most information by picking the least amount of data is a common task in experimental design or when exploring an unknown environment in reinforcement learning and robotics. A widely used measure for quantifying the information contained in some distribution of interest is its entropy. Greedily minimizing the expected entropy is therefore a standard method for choosing samples in order to gain strong beliefs about the underlying random variables. We show that this approach is prone to temporally getting stuck in local optima corresponding to wrongly biased beliefs. We suggest instead maximizing the expected cross entropy between old and new belief, which aims at challenging refutable beliefs and thereby avoids these local optima. We show that both criteria are closely related and that their difference can be traced back to the asymmetry of the Kullback-Leibler divergence. In illustrative examples as well as simulated and real-world experiments we demonstrate the advantage of cross entropy over simple entropy for practical applications. Information gain · Experimental design · Exploration · Active learning · Cross entropy · Robotics
منابع مشابه
Evaluation of monitoring network density using discrete entropy theory
The regional evaluation of monitoring stations for water resources can be of great importance due to its role in finding appropriate locations for stations, the maximum gathering of useful information and preventing the accumulation of unnecessary information and ultimately reducing the cost of data collection. Based on the theory of discrete entropy, this study analyzes the density of rain gag...
متن کاملA new approach factor- entropy with application to business costs of SMEs in Shanghai
Business cost is acknowledged as one of the priorities in SMEs research. In thisstudy, the business cost of SMEs in Shanghai was primarily measured using Factor-Entropy analysis method. The purpose of this study is to effectively resolve the issueof simplification and assignment evaluation index system on business costs of SMEsin Shanghai. However, this study uses factor analysis to interpret t...
متن کاملAn iterative algorithm for minimum cross entropy thresholding
A fast iterative method is derived for minimum cross entropy thresholding using a one-point iteration scheme. Simulations performed using synthetic generated histograms and a real image show the speed advantage and the accuracy of the iterated version. q 1998 Elsevier Science B.V. All rights reserved.
متن کاملInfluence of inclined Lorentz forces on entropy generation analysis for viscoelastic fluid over a stretching sheet with nonlinear thermal radiation and heat source/sink
In the present study, an analytical investigation on the entropy generation examination for viscoelastic fluid flow involving inclined magnetic field and non-linear thermal radiation aspects with the heat source and sink over a stretching sheet has been done. The boundary layer governing partial differential equations were converted in terms of appropriate similarity transformations to non-line...
متن کاملTsallis Entropy and Conditional Tsallis Entropy of Fuzzy Partitions
The purpose of this study is to define the concepts of Tsallis entropy and conditional Tsallis entropy of fuzzy partitions and to obtain some results concerning this kind entropy. We show that the Tsallis entropy of fuzzy partitions has the subadditivity and concavity properties. We study this information measure under the refinement and zero mode subset relations. We check the chain rules for ...
متن کامل